Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Analysis of factors affecting efficiency of data distributed parallel application in cloud environment
MA Shengjun, CHEN Wanghu, YU Maoyi, LI Jinrong, JIA Wenbo
Journal of Computer Applications    2017, 37 (7): 1883-1887.   DOI: 10.11772/j.issn.1001-9081.2017.07.1883
Abstract628)      PDF (795KB)(373)       Save
Data distributed parallel applications like MapReduce are widely used. Focusing on the issues such as low execution efficiency and high cost of such applications, a case analysis of Hadoop was given. Firstly, based on the analyses of the execution processes of such applications, it was found that the data volume, the numbers of the nodes and tasks were the main factors that affected their execution efficiency. Secondly, the impacts of the factors mentioned above on the execution efficiency of an application were explored. Finally, based on a set of experiments, two important novel rules were derived as follows. Given a specific volume of data, the execution efficiency of a data distributed parallel application could not be improved remarkably only by increasing the number of nodes, but the execution cost would raise on the contrary. However, when the number of tasks was nearly equal to that of the nodes, a higher efficiency and lower cost could be got for such an application. The conclusions are useful for users to optimize their data distributed parallel applications and to estimate the necessary computing resources to be rented in a cloud environment.
Reference | Related Articles | Metrics
Execution optimization policy of scientific workflow based on cluster aggregation under cloud environment
DUAN Ju, CHEN Wanghu, WANG Runping, YU Maoyi, WANG Shikai
Journal of Computer Applications    2015, 35 (6): 1580-1584.   DOI: 10.11772/j.issn.1001-9081.2015.06.1580
Abstract442)      PDF (783KB)(390)       Save

Focusing on the higher ratio of processor utilization and lower execution cost of a scientific workflow in cloud, a policy of execution optimization based on task cluster aggregation was proposed. First, the tasks were reasonably replicated and aggregated into several clusters. Therefore, the key tasks could be scheduled as early as possible. Then, the task clusters were aggregated again to facilitate the spare time among the tasks in the task cluster. The experimental results show that the proposed policy can improve the parallelism of workflow tasks, advance the earliest finish time of the whole workflow and it has a significant effect in improving the utilization ratio of processors and lowering the cost of workflow execution.

Reference | Related Articles | Metrics